Auditory model inversion for sound separation

نویسندگان

Malcolm Slaney

Daniel Naar

Richard F. Lyon

چکیده

1 Techniques to recreate sounds from perceptual displays known as cochleagrams and correlograms are developed using a convex projection framework. Prior work on cochlear-model inversion is extended to account for rectiÞcation and gain adaptation. A prior technique for phase recovery in spectrogram inversion is combined with the synchronized overlap-and-add technique of speech rate modiÞcation, and is applied to inverting the short-time autocorrelation function representation in the auditory correlogram. Improved methods of initial phase estimation are explored. A range of computational cost options, with and without iteration, produce a range of quality levels from fair to near perfect.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lyon’s Auditory Model Inversion: a Tool for Sound Separation and Speech Enhancement

A new implementation of Lyon’s Auditory Model and an optimised inversion procedure will be presented. Both the passive and active Lyon’s cochlea models were studied as new signal processing analysis schemes, while only the first one was considered regarding the inversion procedure. Following the work of M. Slaney, sound resynthesis was obtained inverting the correlogram representation by a new ...

متن کامل

Pattern Playback in the 90s

Deciding the appropriate representation to use for modeling human auditory processing is a critical issue in auditory science. While engineers have successfully performed many single-speaker tasks with LPC and spectrogram methods, more difficult problems will need a richer representation. This paper describes a powerful auditory representation known as the correlogram and shows how this non-lin...

متن کامل

A Quantitative Evaluation of a Bio-inspired Sound Segregation Technique for Two- and Three-Source Mixtures

A sound source separation technique based on a bio-inspired neural network, capable of functioning in more than two-source mixtures, is proposed. Separation results are compared with other proposed techniques in the literature using quantitative evaluation criteria. 1 The sound source separation problem In our life we are confronted to situation in which a mixture of sound sources is present in...

متن کامل

The History and Future of CASA

1 INTRODUCTION In this chapter I briefly review the history and the future of computational auditory scene analysis (CASA). Auditory scene analysis describes the process we use to understand the world around us. Our two ears hear a cacophony of sounds and understand that the periodic tic-toc comes from a clock, the singing voice comes from a radio and the steady hum is coming from the refrigera...

متن کامل

Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming

Abstract A better understanding of auditory scene analysis requires uncovering the brain processes that govern the segregation of sound patterns into perceptual streams. Existing models of auditory streaming emphasize tonotopic or “spatial” separation of neural responses as the primary determinant of stream segregation. While partially true, this theory is far from complete. It overlooks the in...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

Auditory model inversion for sound separation

نویسندگان

چکیده

منابع مشابه

Lyon’s Auditory Model Inversion: a Tool for Sound Separation and Speech Enhancement

Pattern Playback in the 90s

A Quantitative Evaluation of a Bio-inspired Sound Segregation Technique for Two- and Three-Source Mixtures

The History and Future of CASA

Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming

عنوان ژورنال:

اشتراک گذاری